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We exploit a result by Nerman which shows that conditional limit theorems hold when a certain 
monotonicity condition is satisfied. Our main result is an application to vertex degrees in random 
graphs, where we obtain asymptotic normality for the number of vertices with a given degree in 
the random graph G{n, m) with a fixed number of edges from the corresponding result for the 
random graph G{n,p) with independent edges. We also give some simple applications to random 
allocations and to spacings. Finally, inspired by these results, but logically independent of them, 
we investigate whether a one-sided version of the Cramer- Wold theorem holds. We show that 
such a version holds under a weak supplementary condition, but not without it. 
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1. Introduction 

Many random variables in difTcrcnt areas of probability, statistics and combinatorics 
can be expressed as some "simpler" random variable conditioned on a specific value of 
another. A few examples are given in Sections 3 and 4 below; many other can be found 
in the references. 

Such representations are, among other things, useful for the derivation of asymptotic 

results. Generally speaking, if {Xn,Yn) ---^ would like to conclude that the 

conditional distributions also converge, that is, (X„ I y„ = y)^{X\Y = y). Of course, 
this is not true in general, but it holds in many cases and several authors have proved 
more or less general theorems of this type under various assumptions; see, for example, 
Steck [28], Hoist [9, 11], Janson [17] and the further references given there. (For a trivial 
counterexample which shows that some assumptions are needed, let P(X„ = y„ = 0) = 
1/n and P(A:„ = 1, y„ = 1/n) = 1 - 1/n, with AT = 1 and F = 0. Then, (A„ ] y„ = 0) = 0, 
but (A 1 r = 0) = 1 a.s.) 

The purpose of the present paper is to exploit a result of Nerman [23] which shows 
that such conditional limit theorems hold in the special, but not uncommon, situation 
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that a certain monotonicity condition holds. This result seems to have been somewhat 
neglected, but it has many applications. We illustrate its power first by some simple 
applications to random allocations. Our main result is an application to vertex degrees 
in random graphs, where we derive a new result for the random graph G{n, m) with 
a fixed number of edges from the corresponding (but slightly different) result for the 
random graph G{n,p) with independent edges; indeed, it was this problem that led to 
the present paper. The method also applies to other properties of G(n, m) and we mention 
some of these. 

Wc state Nerman's theorem, in versions suitable for easy applications, in Section 2. 
In Section 3, we illustrate the theorems by a simple application to random allocations, 
where we give short proofs of some known results. We present our main application to 
random graphs in Section 4, where we state and prove Theorem 4.1 on vertex degrees. 
This also illustrates how results for other monotone functions of random graphs may be 
obtained. An application of Theorem 4.1 to the study of the fc-core is given in [18]; this 
application was the original motivation for the research that led to the present paper. 

We provide another simple application, to spacings, in Section 5. 

It often happens that vector-valued versions of limit theorems follow from the one- 
dimensional versions by the Cramer-Wold device, that is, by considering linear com- 
binations of the components; see Cramer and Wold [6] or, for example, Billingsley [4], 
Theorem 7.7. This is not the case here, since the assumptions in the multidimensional case 
of Theorem 2.2 allow us to apply the one-dimensional case only to linear combinations 

tiXn'' H \-tdX^\ where all ti have the same sign. Although not needed for our results 

(the multidimensional case is proved by Nerman as the one-dimensional case) , wc find it 
interesting to investigate whether, in general, it is enough to show convergence for such 
linear combinations, that is, whether there is a one-sided version of the Cramer- Wold 
device. In Section 6, wc show that the answer is affirmative under a weak supplementary 
condition, but not in general. This is equivalent to a corresponding uniqueness problem 
and to the question of whether a characteristic function is determined by its restriction 
to the first octant. 

All unspecified limits below are as n — > oo. Further, as usual, 5ij is 1 when i= j and 
otherwise. 

2. Nerman's general results 

Definition. Let X and Y be random variables defined on the same probability space. We 
say that X is stochastically increasing with respect to Y if the conditional distributions 
C{X \ Y — y) are stochastically increasing in y, that is, if 

f{X <x\Y = yi) 

(2.1) 

> P[X < X \ Y = y2) for any real x and yi < 2/2- 

// Y has a discrete distribution, we may, here and below, consider only y ( and yi,y2) 
such that P(F = j/) > and there is no problem with defining the conditional distributions 
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and probabilities. In general, for example, for continuous Y , the conditional distribution 
C{X \ Y = y) is defined only up to equivalence, that is, for a.e. y with respect to the 
distribution C{Y). The precise definition is that there exists a version of y > C{X \ Y ^ y) 
that is stochastically increasing in y; it is this version that is used below. (Thus, (2.1) 
holds with the conditional probabilities defined by this version.) 

We say that X is stochastically decreasing with respect to Y if —X is stochastically 
increasing, and stochastically monotone with respect to Y if it is either stochastically 
increasing or stochastically decreasing with respect to Y . 

The definitions extend to vector- valued X and Y , using the partial order on defined 
by (si, ...,Sd)<{ti,...,td) if Si < ti for each i. 

Remark 2.1. It is well known that if X is real-valued, then (2.1) is equivalent to the 
existence of an increasing coupling of {X \ Y = yi) and (X | F = t/2), that is, a pair of 

random variables Xi and X2 such that Xj = {X \ Y = yj), j = Q,\, and Xi < X2 a.s. This 
is not generally true in the vector-valued case, but the (perhaps more natural) condition 
that there always exists such an increasing coupling is stronger and implies (2.1). 

We can now state Nerman's theorem. Let 'P{M.'') denote the set of probability measures 
on R'', equipped with the usual (weak) topology. 

Theorem 2.2. Suppose that {X„,Y„), n > 1, are pairs of random vectors, with Xn G 
and Yn G W' for some q,r> \, such that X„ is stochastically monotone with respect to 
Y„ . Further, suppose that for some sequences of real numbers and vectors an > , bn G R'^, 

C„ >0,dn€ M^ 

ia~\X„ - 6„),c-i(r„ - d„)) ^ {X,Y) 

for a pair of random vectors X €zW^ and Y dW . Assume that yn is a sequence in W such 
that c~^(jjn — dn) — > £, for some ^ G R'' and let Xn be a random vector whose distribution 
equals the conditioned distribution C{Xn \ Yn — yn)- Finally, suppose that ^ is an interior 
point of the support of Y and that there exists a version of y t—i- C{X \ Y ~ y) that is 
continuous at y = ^ as a function of y into ^^(R*). Then, 

a-\Xn-bn)^C{X\Y = 0- 

Proof. Nerman [23], Theorem 1 and Remark, proved the case a„ = c,i = 1, 6„ = 0, 
dn = 0. The general version follows immediately by replacing (X„,y„) with (a^^(X„ — 

bn),C-\Yn-dn)). □ 

The case when X and Y have a joint normal distribution is perhaps the most inter- 
esting, both because it appears in many applications and because the result simplifies 
somewhat. In this case, assuming that the covariance matrix of Y is non-singular, it is 
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elementary and well known that there exists a continuous version of y i— *■ C{X \ Y = y), 
given by 

iX\Y^y)^X + A{y~Y), (2.2) 

where A is the q x r matrix given by A := Cov(X,y)(Var(y))^^. Note that this, too, 
has a normal distribution. (To see (2.2), note that Z X — AY and Y are uncorrelated 

and thus independent. Since X = Z + AY, it follows that {X \Y = ij) = Z + Ay = X + 
A{y-Y).) 

For ease of application, we state the result in the normal case separately, restricting 
ourselves to the case r = 1, which simplifies notation and is the most important case for 
applications. 

Theorem 2.3. Suppose that {Xn,Yn), n > 1, are pairs of random vectors Xn = 
{Xn\ ■ ■ ■ ,Xn^) G and variables y^i G K for some q>l, such that X^ is stochastically 
monotone with respect to Yn- Suppose, further, that for some sequences of real numbers 
and vectors a„ > 0, 6„ G R"^, c„ > 0, c?„ G M, 

(a-i(X„ -6„),c,;\r„ - d„)) ^ {X,Y) (2.3) 

for a normally distributed random vector [X, Y) with X = {X^^\ . . . , X*-'-*) G M'^ such that 
Var(y)>0. (Thus, X'^'^\ . . . , X^i\Y are jointly normal.) 

Assume that yn is a sequence in R such that c~^(y„ — dn) £, for some real ^ and 
let Xn = {Xn\---,Xn^) be a random vector whose distribution equals the conditioned 
distribution £(X„ | Yn = yn)- 

Then, withj={j('^\...,j^i'>), w/iere = Cov(xW,r)/ Var(y), 

a-\Xn - bn) -^X:=X + i^- Y)j. 

Thus, X = (X(i) , . . . , X(9)) IS normal with 

EX = EX + (^ - EY)j, 
Cov(x(*) , X^^^ ) = Cov(x(*' , X(J') ) - Cov(X(') , Y) Cov(x(j') , Y) / Var(y ). 

In the one-dimensional case, this result may be stated as follows. 

Corollary 2.4. Suppose that the assumptions of Theorem 2.3 hold with q=\ and thus 
Xn and X real-valued. Let a\ := Var(X), Uy ■= Var(y), axY ■= Cov{X,Y) and 7 := 
CTjcy/cy- Then, 

a-\Xn - bn) N{EX + 7(e - EY),aj, - aj^y/a^). (2.4) 

//, further, a\ > 0, then the asymptotic variance in (2.4) equals (1 — p^)cr^, where p := 
o'xY /{o'xO'y) is the correlation between X and Y. 
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The variance in (2.4) is the same as the residual variance in linear regression. This 
extends to the multi-dimensional case. 

We can weaken the assumptions in the multi-dimensional case somewhat. 

Corollary 2.5. In Theorems 2.2 and 2.3, we can replace the assumption that Xn is 
stochastically monotone with respect to Yn by the assumption that TXn is stochastically 
monotone with respect to Yn for some invertible linear operator T on . 

Proof. We can apply the theorems to (TX„,y„), with X and 6„ replaced by TX and 
Thn, respectively. The result follows by applying T~^. □ 

We end this section by stating a companion result of Ncrman [23], Section 4, on moment 
convergence. 

Theorem 2.6. If the pth absolute moments of the components of an^{Xn — converge 
to the corresponding moments of X in one of the theorems or corollaries above, then all 
(mixed) moments and absolute moments of order at most p of a~^{Xn — bn) converge to 
the corresponding moments of X . In particular, if the means and (co)variances a^^(X„ — 
bn) converge to those of X , then the means and (co)variances of an^{Xn — bn) converge 
to those of X . 

3. A simple application: random allocation 

Example 3.1. Let m balls be thrown independently of each other into n boxes, with 
probability \/n of landing in each box. Let be the number of balls landing in box fc, 
fc = 1, . . . , n. (Thus, (A'^i, . . . , iV„) has a multinomial distribution.) Let Z„in be the number 
of empty boxes. We are interested in asymptotics as n — » cx) and m = m{n) — > oo. This 
(and various extensions) has been studied by many authors; see, for example, von Mises 
[22], Feller [7], Section IV.2, Arfwedson [1], Weiss [29], Renyi [25], Rosen [26, 27], Hoist 
[12] , Hwang and Janson [14] and the monograph by Kolchin, Sevast'yanov and Chistyakov 
[20]. 

In order to apply the results above, we instead throw a random number M ^ Po(A„n) 
balls. The numbers Nk of balls in the different boxes arc then i.i.d. with Nk ~ Po(A„). 
(See Hoist [12, 13] for similar uses of Poissonization in this and related problems.) We 
let Xn denote the number of empty boxes, let Yn := M and observe that 

n 

(x„,y„) = ^(i[iVfe = o],7Vfe). 

k=l 

The terms in the sum are i.i.d. random vectors with mean (e~'''",A„) and covariance 
matrix given by 

Var(l[7Vfe = 0]) = e-^"(l-e-^"), 
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Var(A^fe) = A„, 
Cov(l[7V, = 0],7Vfe) = -e-^"A„. 

It follows from the central limit theorem that if n — > oo and A„ ^ A > 0, then 

n~'^\X,, - nc-^",y„ - nA„) ^ (X,y), (3.1) 

with X and y jointly normal with mean and (co)variances cr^ = e~'''(l — e^'*'), ay = A, 
axY = -Ac"-^. 

It is obvious that X„ is stochastically decreasing with respect to y„ = M, since throw- 
ing another ball can only decrease the number of empty boxes. Moreover, if we condition 
on Y„ = TO, we are back in the situation of throwing a given number m balls and thus 
Xn = Z„in- As is well known in this and many related situations, we can here take any 
Xn > 0, but for the continuation of the argument, the choice matters. We use the (natural) 
choice A„ =m{n)/n. 

Consequently, if m — m{n) is such that m{n)/n ^ A > 0, then by Corollary 2.4, using 
On c„ = 71,1/2, ^ ne"^", dn = riA„ = m{n), y„ = m(n), ^ = and 7 = (Txy/'^y = 
— e^'^, we have 

^ X - 7y = X + e^^r - N{0, e"^ - 0"^^ - Ae'^^), 

as shown by Weiss [29]; see also Renyi [25] and Kolchin et al. [20], Theorem 1.3.1. 

Corollary 2.4 does not apply directly to the number of boxes with exactly one ball, 
since this is not stochastically monotone with respect to the number of balls. However, 
denoting this number by Zmn, the sum Zmn + Zmn is the number of boxes with at 
most one ball, which is stochastically decreasing with respect to m. Consequently, we 
can argue, as above, for the vector {Zmn, Zmn + Zmn), using Theorem 2.3, and conclude 
joint asymptotic normality for {Z„in, Zmn + Zmn) and thus for {Zmn, Zmli)- This is a 
simple instance of Corollary 2.5. 

More generally, if Zmn is the number of boxes with exactly j balls, then Corollary 2.5 
apphes to {Zmn, Zmn, Z^nl), for any fixed J, with 

T{zo, zi, . . . , zj) = (zq, zq + 2i, . . . , zq H 'r zj). (3.2) 

We again assume that m = m{n) is such that A„ :— m/n^ \ > Q and denote the Poisson 
probabilities by 

\ k 

7rA(fc):=P(Po(A) = fc) = — e-\ (3.3) 

We can take a„ = = n^/^, bn'' = 7''A„(j)?^, dn = yn = m{n) and then (2.3) holds by the 
central limit theorem; in this case, we have, if we let W denote a random variable with 
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VarF = VarVK = A 

and thus 

Cov(X(*),x(^')) ■.^Cov{l[W = i],l[W = j]) 

- Cov(l [VK ^i],W) Cov(l [M^ = j] , W) / Var W (3.4) 

= 6,,7r,{^) - n,{^)n,{J) (^1 + Il^^Kl—^) 
In other words, jointly for aU j >0, 

where X'^^\ j > 0, are jointly Gaussian with means and covariances given by (3.4), as 
shown by other methods in Kolchin et al. [20], Theorem II. 2. 3; see also Bekessy [3]. All 
(mixed) moments converge by Theorem 2.6. 

We have, for simplicity, considered only the case m/n ^ A > 0, but the results are easily 
extended to the cases m/n — > and m/n oo (at appropriate rates). Moreover, we can 
study the case of different probabilities pi , . . . , p„ for the boxes by the same method; 
this generalization is studied in several of the references listed above. We can, further, 
as in some of the references, study sums J^k H^k, k) for other functions h, possibly also 
depending on the box k. In this way, new proofs of several results in the above references 
may be obtained, but we leave these extensions to the reader. 



4. Vertex degrees in random graphs 

Let Qn be the set of all graphs with the n (labelled) vertices 1, . . . , n. Two basic and 
widely studied models of random graphs are known as G{n,m) and G{n,p). G{n,m), 
where < m < (2) , is obtained by choosing an element of Qn with exactly m edges at 
random (uniformly). G{n,p), where 0<p< 1, is defined by making a random choice 
for each pair of distinct vertices and connecting them by an edge with probability p, 
independently of all other edges. Note that the number of edges in G{n,p) is Jii{(^^),p) 
and that G{n,m) can be obtained as G{n,p) conditioned on having exactly m edges, for 
any m and p € (0, 1). See, further, BoUobas [5] or Janson, Luczak and Rucihski [19]. 

It is well known that the two random graph models G{n,p) and G{n, m) are very similar 
and, for many properties and quantities, they show the same asymptotic behaviour (for 
appropriate p = p{n) and m = m{n)). In general, it is usually easy to obtain results for 



Monotonicity, normality and random graphs 



959 



G(n,p) from the corresponding results for G{n,m), but it is often more difficult to go in 
the opposite direction. 

For monotone properties or quantities, the situation is simple and it is possible to draw 
conclusions in both directions. This is well known for thresholds of monotone functions 
and for convergence in probability of monotone quantities. Theorem 2.3 shows that, 
under very general conditions, this also holds for asymptotic normality, although the 
asymptotic variances will usually be different for G{n,p) and G{n,m). 

The fact that the asymptotic variances generally differ is easily seen by proceeding in 
the other direction, from G{n,m) to G{n,p); see Pittel [24]. This is a standard analysis 
of variance argument; the variance for G(n,p) will have an extra term that can be in- 
terpreted as the part of the variance that is explained by the variation in the number 
of edges. In many situations, this term is of the same order as the variance for G{n,m) 
and then the two models will have variances that are different, but of the same order. 
In other cases, one term may dominate the other. If the extra term is dominated by 
the variance for G{n,m), then G{n,m) and G{n,p) have the same asymptotic variance. 
In the opposite case, the asymptotic variance for G{n, m) is of a smaller order than for 
G{n,p). It is easily seen that the latter case is exactly the case when the "first projection 
method" applies for G{n,p); see [19], Section 6.4. In this case, it is not possible to derive 
precise results for G{n,m) from the limit results for G{n,p) (at least not without more 
detailed information). A typical case where our approach to G{n,m) thus fails is the 
number of copies of a given small subgraph H in G{n,p) with constant p and in G{n,m) 
with m = p{^)', if H has v vertices, the variance is, in general, of the order for 
G{n,p) and n'^^~'^ for G{n,m) [15]. 

As an application of Nerman's theorem, we consider the numbers of vertices of different 
degrees in G{n,p) and G{n, m), with p ^ c/n and to ~ cn/2. For G{n,p), it is known that 
these numbers have asymptotic normal distributions; this easily extends to their joint 
distribution. We then use Nerman's theorem to find the same property for G{n,m), but 
with somewhat different asymptotic variances and covarianccs. Recall the notation (3.3). 

Theorem 4.1. (i) Consider G{n,p), where p — p{n) ~ A„/n and A„ ^ A > 0, and let 

Nk = Nk{n) be the number of vertices of degree k, k >0. Then, 

n~'/^{Nk-Trx„{k)n)^Uk, fc > 0, (4.1) 
jointly for all k, with Uk jointly normal with E?7/c = and covariances 



Cov{Uj,Uk) = <Jjk ^A(j>A(fc) ( - 1 ) + 7T^{k)S,k. 



More generally, for any sequence (ari)o° '^f '"^^^ numbers with a,i = 0(^") for some A < 
oo, 



n-i/2(^afciVfe-^afc^A„(fc)n) ^5]afct/fc, (4.2) 

V fc fc J k 
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which is a normal random variable with mean and variance ^ ajakcrjk- 

(ii) The same results hold for G{n,m), where m = m{n) and A„ := 2m/ n ^ A > 0, 
except that Uk is now replaced by Uk with EUk — and 



Cov(C/„C/fc) =7rA(j>A(fc)( _ 1 ) +n^{k)5,k. (4.3) 



Proof. First, consider G{n,\n/'n). It is shown in [2], by Stein's method (see also [19], 
Example 6.35), that each Nk is asymptotically normal. More precisely, 

n-'/\Nk-ENk)^Uk^N{0,akk), (4.4) 

where 

akk:= Mm n-'YaTNk = nxiky(^^—^-l)+TTx{k). (4.5) 
Moreover, with p„ :— X^/n, uniformly in > 0, we have 



k\ 



(4.6) 



Hence, we may replace 'ENk by Tr\^{k)n in (4.4). 

The proof immediately extends to finite linear combinations of Nk, which shows joint 
convergence in (4.4) for all r > 0; the covariances are given by 

a,k ■■= Cov([/„ C/fc) = 7rA(j)7rA(fc) j^ll^^K^^ _ + 7:x{3)5jk. 

Now, assume that Ok = 0{A^) are given real numbers. We claim that for any such ak, 
we have 

oo oo 
fc=0 fc=0 

Indeed, by the joint convergence in (4.4), this holds for the partial sums X^aLo ^"^^ 
finite K . Moreover, from the exact formula for YaiNk (see [19], Example 6.35), it easily 
follows that, for any given B such that sup,j A„ < -B, 

n-^YaTNk^O{B''/kl) 
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oo to obtain (4.7) (cf. [4], Theorem 4.2). 



^ _ (fc)^) A^Y^akUk. (4.8) 

Note, further, that by the same argument, or by the Cramer-Wold device, this holds 
jointly for any finite set of sequences (a^) with Uk = 0{A'^). In particular, since the 
number of edges is M = ^ ^ kN^ and since J2 ^""A^ (k) = A„, we have 

oo 

n-^'^{M - iA„n) -^V := i ^fcC/fc. (4.9) 

A;=0 

We can now transfer this result to G{n,m), with A„ = 2m/n ^ A > 0, as assumed in 
(ii). First, for any J > 0, Corollary 2.5 apphes to X„ = (TVi, . . . , Nj) and Yn = M, with 
T as in (3.2). Hence, we can use Theorem 2.3, with y„ ~ dn ~ m~ A„n/2, and conclude 
from (4.1) and (4.9), which by the above hold jointly, that (4.1) also holds (jointly in 
aU fc > 0) for G{n, m), with Uk replaced by Uk := Uk - (Cov([/fc, V)/YaT(y))V; a simple 
calculation yields (4.3). 

Moreover, a sum akUk with ak increasing is stochastically increasing with respect 
to the number of edges and it further follows from Theorem 2.3 that (4.8) also holds for 
G{n,m), with Uk replaced by Uk, provided ak is increasing and ak = 0{A'^), again with 
joint convergence for several such sequences. However, any sequence ak ~ 0{A'^) is the 
difference of two increasing such sequences and thus this result extends to all sequences 
ak ~ 0{A'') by linearity, which shows that (4.2) also holds for G(n, m), with Uk replaced 
by Uk. □ 

Remark 4-2. It follows from (4.4), (4.5) and (4.6) that the mean and variances converge 
in (4.1) for G{n,p). Hence, by Theorem 2.6, they also converge for G{n,m) (with Uk 
replaced by Uk)- 

Remark 4-3. The limit result in (ii) is the same as for the random allocations in Sec- 
tion 3 with 2m balls. Indeed, we can add the edges to G{n,m) one by one, at random, 
and then the vertex degrees can be described by an allocation model, as in Section 3 
with 2m balls, except that the balls arc now thrown in pairs and wc condition on no pair 
being thrown into the same box and no two pairs being thrown into the same two boxes. 
Our results show, unsurprisingly, that this conditioning does not affect the asymptotic 
distribution of the edges. 

Remark 4- 4- The same argument applies to many other monotone functions of random 
graphs, for example, the size of the largest component. In this case, Pittel [24] proved 
asymptotic normality for both G{n,p) and G(n, m), with p = c/n and m = cn/2; he first 
proved the result for G{n,m) and then obtained the result for G{n,p) as a consequence. 
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Nerman's theorem shows that it is also possible to do the opposite, obtaining the result 
for G(n,m) from a result for G(ri,p), perhaps obtained as suggested in [16]. 

Another example is the size of the /c-core, where asymptotic normality is proved in [18] 
for both G{n,p) and G{n,m), using Theorem 4.1 above. Again, Nerman's theorem shows 
that it suffices to study G{n,p), provided we verify joint convergence with the number 
of edges, which provides an alternative way of treating G{n,m). 



5. Another application: spacings 

To illustrate the versatility of Nerman's theorem, we give another simple application, 
where the random variables Y„ are continuous. 

Consider the n spacings Si, . . . , Sn created by n — 1 i.i.d. random uniformly distributed 
points on (0, 1) or by n such points on a circle of circumference 1. Let a be a fixed positive 
number and let Na be the number of spacings greater than a/n. 

If Ti, . . . ,r„ are i.i.d. Exp(l) random variables, then 

(5*1, ... , Sn) = (^{Ti/n, . . . ^ T, = nj 

and thus 

Cn n \ 

The central limit theorem yields 

/ n n \ 



- n 



i=l 



where iX,Y) is normal with EX = EF = and cr| = c-'^(l - e""), ctxf =06"", erf, = 1. 
Corollary 2.4 yields 

7i-i/2(iv„ - ne"") ^ iV(0, e-'' - e'^'^ - a^e'^'^). 

Moment convergence holds by Theorem 2.6. 

This is a simple case of a theorem by Le Cam [21], where more general sums of the 
form '^j_h{nSi) are treated. The method above applies to all monotone functions h 
such that 'E.hiTiY < oo and more general functions can be treated by taking linear 
combinations. We leave the details to the reader. We can similarly study sums of the 
type h{nSi, . . . ,nSi+rn-i) and obtain, for suitable h, a new proof of the asymptotic 
normahty proved by Hoist [10]. 



Monotonicity, normality and random graphs 



963 



6. A one-sided Cramer— Wold theorem 

The following theorem is a version of the Cramer- Wold theorem [6], [4], Theorem 7.7; 
note that the Cramer-Wold theorem assumes (6.1) for arbitrary ti, . . . , t^, but our version 
only assumes this when ti > 0, which, for example, might be important for applications 
to monotone functions. 

Recall that a random vector is determined by its moments if all mixed moments are 
finite and every random vector with the same mixed moments has the same distribution. 
We will use this assumption in our theorem. Note that the Cramer-Wold theorem uses 
no such assumption, but Example 6.4 below shows that some such condition is necessary 
for our one-sided version. 

Theorem 6.1. Suppose that = ix["\ . . .,xf^), n>l, and X ^ (Xi, . ..,Xd) are 
random vectors in , where d > 1, such that 

Y^tat^ ^iZ'^x, (6-1) 

1=1 i=l 

for all real numbers ti, . . . , id > 0. Suppose, further, that the distribution of X is deter- 
mined by its moments. Then, X*-"^ X . (Hence, (6.1) holds for all real ti, ... ,td.) 

Remark 6.2. A simple sufficient condition for X to be determined by its moments is 
that Ee''!'^''! < oo for every k and some a > 0. 

Proof of Theorem 6.1. Taking ti = 5ik in (6.1), we see that X]^'^ Xk for every fc. 
Ill particular, the sequence (X^"'')„ is tight for each k and thus the sequence (X("^)„ 
of random vectors is tight. Consequently, every subsequence has a subsubsequence that 
converges in distribution and to show X*^"^ X, it suffices to show that if some subse- 
quence converges in distribution to Y, then Y ^ X . 

Hence, assume that y = (Fi, . . . , Yd) is such that Y along some subsequence. 

liti, . . . ,td>0, then J2k ^fc^fc"' J2k ^k^k along the same subsequence and (6.1) shows 
that 

d d 

Y,tkYk = Y,ikXk, ti,...,td>0. (6.2) 

fc=i k=i 

In particular, with t = [ti, . . . ,td), denoting the characteristic function of a random vector 
Z by ifizit) :=Ee'*-^, we have 

ipYiti,...,td)= Ec"-^ = Ee"-^ = ipxiti,...,td), ti,...,td>0. 



The result thus follows from the following lemma. 



□ 
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Lemma 6.3. Suppose that X — {Xi, . . . , Xd) and Y = (Yi, . . . , Yd) are random vectors 
in R'^ , where d>l, such that 

(pxiti,...,td)=ipYiti,.-.,td), ti,...,td>0. (6.3) 

Suppose, further, that the distribution of X is determined by its moments. Then, X = Y. 
(Equivalently, (6.3) holds for all real ti,. . . ,td.) 

An equivalent statement is that if (6.2) holds and X is determined by its moments, 

then X ^Y and thus (6.2) holds for all real ti,...,td. The first octant may here be 
replaced by any other cone with non-empty interior, using a linear change of variables. 

Proof of Lemma 6.3. Let \ <k <n. Taking ti—0 for k. we sec that Lpx^{t) = 

(fiYk (t) for t > and thus for all real t, because ^{—t) = ip{t) . Thus, Yk X^. and E| Y^. |'" = 
E|Xfc|™ < oo for every m > 0. It follows that E|X|"' < oo and E|y|™ < oo for every 
m > 0. Hence, both ipx and ipy are infinitely differentiable in . For any multi-index 
a, (6.3) implies that D"ipxiti, . . . ,td) — D^ipyiti, . . .,td) when ti, . . . ,td> Q and thus, 
by continuity, also when <i = • • • = = 0. Consequently, 

EX" = i-l"li:)"(^jf (0, . . . , 0) = i^l^lZJ^vjy (0, . . . , 0) = EF". 

Thus, X and Y have the same moments and hence X ^Y . □ 

Lemma 6.3 says that if a random vector is determined by its moments, its characteristic 
function is determined by its restriction to the first octant. The following example shows 
that this does not hold for all random vectors (even in two dimensions) without the extra 
condition that X be determined by its moments. This extra condition in Theorem 6.1 
and Lemma 6.3 can presumably be weakened and we leave it as an open problem to 
investigate more fully when a characteristic function is determined by its restriction to 
the first octant. (The above proof shows that Theorem 6.1 holds for such random vectors 
X.) 

Example 6.4- Let U, V and W be independent random variables such that their charac- 
teristic functions (pu, ^v, Vw satisfy (fiuit) = for |i| > 1 and ipv{t) = 'fiwit) for \t\ < 1, 
but V and W do not have the same distribution; see, for example. Feller [8], Sections 
XV. 2 and XV. 2a, for examples of such random variables. Define X = {U + V,U —V) and 
Y ={U + W,U -W). Then, 

ipxiti,t2) =ipuiti + t2)vviti -t2), 

^Y{tl,t2) =ipu{tl +t2)^pw{tl -h)- 

If ti,t2 > 0, then either ti + 12 > I and (pu{ti + ^2) = 0, or \ti — t2\ < ti + 12 < I and 
(pv{ti — ^2) = 'Pwiti — h)', in both cases, (px{ti,t2) ~ 'PY{ti,t2), so (6.3) holds. Never- 
theless, X and Y do not have the same distribution since Xi — X2 = 2V and Yi — Y2 = 21^ 
have different distributions. Hence, Lemma 6.3 is not true without the extra condition. 
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It further follows that if ti,i2 > 0, then tiXi + 12X2 and tiYi + t2Y2 have the same 
characteristic function and thus the same distribution. Define X^") = Y for all finite n. 

It follows that (6.1) holds (with equahty for aU n) for ti,t2 > 0, but /> X. (Alter- 
natively, let = y for odd n and = X for even n.) This shows that Theorem 6.1 
also fails without the extra condition. 
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